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Abstract. Constant dimension codes, with a prescribed minimum distance, 
have found recently an application in network coding. All the codewords in such 
a code are subspaces of with a given dimension. A computer search for large 
constant dimension codes is usually inefficient since the search space domain 
is extremely large. Even so, we found that some constant dimension lexicodes 
are larger than other known codes. We show how to make the computer search 
more efficient. In this context we present a formula for the computation of the 
distance between two subspaces, not necessarily of the same dimension. 

1. Introduction 

Let ¥q be the finite field of size q. The set of all fc-dimensional subspaces of the 
vector space F^, for any given two nonnegative integers k and n, < k < forms 
the Grassmannian space (Grassmannian, in short) over F^, denoted by Qq{n,k). 
The Grassmannian space is a metric space, where the subspace distance between 
any two subspaces X and Y in Gq{n, k), is given by 

(1) dsiX,Y) =^ dimX + dimF - 2dim(Xny). 

This is also the definition for the distance between two subspaces of F^' which are 
not of the same dimension. 

We say that C C Qq{n, k) is an (n, M, d, k)q code in the Grassmannian, or 
constant- dimension code, if AI = |C| and ds{X,Y) > d for all distinct elements 
X,Y € C. The minimum distance of C, ds{C), is d. 

Koettcr and Kschischang [10] presented an application of error-correcting codes 
in Gq{n, k) to random network coding. This led to an extensive research for con- 
struction of large codes in the Grassmannian. Constructions and bounds for such 
codes were given in [4, 5, 9, 10, 11, 15, 16, 17]. 

The motivation for this work is an (8,4605,4,4)2 constant dimension lexicode 
constructed in [15] which is larger than any other known codes with the same 
parameters. 

Lexicographic codes, or lexicodes, are greedily generated error-correcting codes 
which were first developed by Levinshtein [12], and rediscovered by Conway and 
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Sloane [2]. The construction of a lexicode with a minimum distance d starts with 
the set S = {So}, where 5*0 is the first element in a lexicographic order, and greedily 
adds the lexicographically first element whose distance from all the elements of S 
is at least d. In the Hamming space, the lexicodes include the optimal codes, such 
as the Hamming codes and the Golay codes. 

To construct a lexicode, we need first to define some order of all subspaccs in the 
Grassmannian. The (8,4605,4,4)2 lexicode found in [15] is based on the Ferrers 
tableaux form representation of a subspace. First, for completeness, we provide the 
definitions which are required to define the Ferrers tableaux form representation of 
subspaces in the Grassmannian, and next, we define the order of the Grassmannian 
based on this representation. 

A partition of a positive integer m is a representation of to as a sum of positive 
integers, not necessarily distinct. 

A Ferrers diagram T represents a partition as a pattern of dots with the i-th row 
having the same number of dots as the z-th term in the partition [1, 13, 18]. (In the 
sequel, a dot will be denoted by a " • "). A Ferrers diagram satisfies the following 
conditions. 

• The number of dots in a row is at most the number of dots in the previous 
row. 

• All the dots are shifted to the right of the diagram. 

The number of rows ( columns ) of the Ferrers diagram J- is the number of dots in 
the rightmost column (top row) of T. If the number of rows in the Ferrers diagram 
is TO and the number of columns is 77, we say that it is an to x 77 Ferrers diagram. 

Let X £ Gq{n, k) be a fc-dimensional subspace in the Grassmannian. We can 
represent X by the k linearly independent vectors from X which form a unique 
k X n generator matrix in reduced row echelon form (RREF), denoted by RE{X), 
and defined as follows: 

• The leading coefficient of a row is always to the right of the leading coeffi- 
cient of the previous row. 

• All leading coefficients are ones. 

• Every leading coefficient is the only nonzero entry in its column. 

For each X G Gq{n, k) we associate a binary vector of length n and weight fc, 
v{X), called the identifying vector of X, where the ones in v{X) are exactly in the 
positions where RE{X) has the leading ones. 

The echelon Ferrers form of a binary vector v of length n and weight fc, EF{v), 
is the k X n matrix in RREF with leading entries (of rows) in the columns indexed 
by the nonzero entries of v and " • " in all entries which do not have terminal zeroes 
or ones (see [4]). The dots of this matrix form the Ferrers diagram of EF{v). If 
we substitute elements of ¥q in the dots of EF{v) we obtain a generator matrix in 
RREF of a fc-dimcnsional subspace of Qq{n, k). EF{v) and F will be called also the 
echelon Ferrers form and the Ferrers diagram of such a subspace, respectively. 

The Ferrers tableaux form of a subspace X, denoted by J-{X), is obtained by 
assigning the values of RE{X) in the Ferrers diagram Fx of X. Each Ferrers 
tableaux form represents a unique subspace in Qq(n, k). 



LARGE CONSTANT DIMENSION CODES AND LEXICODES 



3 



Example 1. Let X be the subspace in Q2{1 , 3) with the foUowing generator matrix 
in RREF: 

/ 1 1 1 \ 
RE{X) =0010101. 

\ 1 1 1 y 

Its identifying vector is v{X) = 1011000, and its echelon Ferrers form, Ferrers 
diagram, and Ferrers tableaux form are given by 

"1«00»»»1 •••• 0110 
0010»»» , and 10 1, respectively . 

0001»»»J ••• Oil 

Let be a Ferrers diagram of a subspace X G Qq{n,k). T can be embed- 
ded in a fc X (n — fc) box. We represent T by an integer vector of length n — k, 
(Jvi-fc, J-2, J^i), where is equal to the number of dots in the i-ih column of 
J-, 1 < i < n — k, where we number the columns from right to left. Note that 
J'i+i ^ J'ii l<i<n — k — 1. 

To define an order of all the subspaces in the Grassmannian we need first to 
define an order of all the Ferrers diagrams embedded in the k x {n — k) box. Let 
denote the size of i.e., the number of dots in J-. For two Ferrers diagrams T 
and T, we say that < T ii one of the following two conditions holds. 

• = \ J-\, and J-i > Ti for the least index i where the two diagrams J- and 
T have a different number of dots. 

Now, we define the following order of subspaces in the Grassmannian based on 
the Ferrers tablearrx form representation. Let X, K £ Qq(n, k) be two fc-dimensional 
subspaces, and RE{X), RE{Y) their related RREFs. Let v{X), v(V) be the 
identifying vectors of X, Y, respectively, and J^y the Ferrers diagrams of 

EF{v{X)), £'i^(7;(F)), respectively. Let xi, X2, x^jr^^ and yi, 2/2, y|j=-y | be the 
entries of Ferrers tableaux forms J^(X) and J-{Y), respectively. The entries of a 
Ferrers tableaux form are numbered from right to left, and from top to bottom. 

We say that X <Y ii one of the following two conditions holds. 

• Tx < Ty\ 

• Tx^ J"y, and ixx^x^, ...,x\jr^\) < {yi,y2, ■■■,y\jry\). 



Example 2. Let X, Y,Z,W e ^2(6, 3) be given by 

111 101 111 111 

jr(x) =111, JP(y) = 00, TiZ) = 11, TiW) = 11. 

1 11 1 

By the definition, wc have that J-y < J-x < J'z = J'w- Since (zi, Z2, ~ 
(1, 1,0, 1, 1, 1) < (u)!, ...,'u;|^„-|) = (1, 1, 1, 1, 1, 1), it follows that Y < X < Z <W. 

The construction of lexicodes involves many computations of the distance be- 
tween two subspaces of Qq{n, k). In Section 2 we develop a new formula for compu- 
tation of the distance between two subspaces not necessarily of the same dimension. 
This formula will enable a faster computation of the distance between any two sub- 
spaces of Gq(n, fc). In Section 3 we examine several properties of constant dimension 
codes which will enable to simplify the computer search for large lexicodes. In Sec- 
tion 4 we describe a general search method for constant dimension lexicodes. We 
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also present some improvements on the sizes of constant dimension codes. In Sec- 
tion 5 we summarize our results and present several problems for further research. 

2. Computation of Distance between Subspaces 

The research on error-correcting codes in the Grassmannian in general and on 
the search for related Icxicodcs in particular requires many computations of the 
distance between two subspaces in the Grassmannian. We will examine a more 
general computation problem of the distance between any two subspaces X,Y C 
which do not necessarily have the same dimension. The motivation is to simplify the 
computations that lead to the next subspace which will be joined to the lexicode. 

Let A* B denotes the concatenation ^ ^ ^ of two matrices A and B with the 

same number of columns. By the definition of the subspace distance (1), it follows 
that 

(2) ds{X, r) = 2 mnk{RE{X) * RE{Y)) - Tank{RE{X)) - Tank(RE{Y)). 

Therefore, the calculation of ds{X,Y) can be done by using Gauss elimination. 
In this section we present an improvement on this calculation by using the represen- 
tation of subspaces by Ferrers tableaux forms, from which their identifying vectors 
and their RREF are easily determined. We will present an alternative formula for 
the computation of the distance between two subspaces X and Y . 

For X e Gqin, ki) and Y £ Qq(n, fe), let p{X, Y) [//(X, 1")] be a set of indices (of 
coordinates) with common zeroes [ones] in v{X) and viY)^ i.e., 

p{X, Y) ^ {z| v{X\ = and viY), = 0} , 

and 

p(X, Y) = {i\ v{X), = 1 and v{Y), = 1} . 
Note that \p{X,Y)\ + \piX,Y)\+dHiv{X),v{Y)) = n, where dni-, ■) denotes the 
Hamming distance, and 

(3) |M(x,r)| = ^^ + ^^-^"^(-(^)'-(^)). 

Let Xfj_ be the y)| x n sub-matrix of RE{X) which consists of the rows 

with leading ones in the columns related to (indexed by) p,{X, Y). Let X^c be the 
{ki — \p{X, Y)\) X n sub-matrix of RE{X) which consists of all the rows of RE{X) 
which are not contained in X^. Similarly, let be the |/Lt(X, y)| x n sub-matrix 
of RE{Y) which consists of the rows with leading ones in the columns related to 
p,{X,Y). Let Y^c be the (fc2 — |/i(X, y)|) x n sub-matrix of RE{Y) which consists 
of all the rows of RE(Y) which arc not contained in Y"^. 

Let Xfj_ be the |^(X, y)| x n sub-matrix of RE{RE{X) * Y^c) which consists 
of the rows with leading ones in the columns indexed by p{X, Y). Intuitively, 
obtained by concatenation of the two matrices, RE{X) and y^c, and "cleaning" 
(by adding the corresponding rows of Y^c) all the nonzero entries in columns of 
RE{X) indexed by leading ones in Y^c . Finally, is obtained by taking only the 
rows which are indexed by ii{X,Y). Thus, X^ has all-zeroes columns indexed by 
ones of v{Y) and v{X) which arc not in /i(X, y). Hence X^ has nonzero elements 
only in columns indexed by p{X, Y) U /i(X, Y). 

Let y^ be the |^(X, y)| x n sub-matrix of RE{RE{Y) * X^c) which consists 
of the rows with leading ones in the columns indexed by p{X,Y). Similarly to 
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X^, it can be verified that has nonzero elements only in columns indexed by 
piX,Y)U^iiX,Y). 

Corollary 1. Nonzero entries in X^ — Y^ can appear only in columns indexed by 
piX,Y). 

Proof. An immediate consequence since the columns of X^ and Y,, indexed by 
fi{X,Y) form a \fi{X,Y)\ x \ii{X,Y)\ identity matrix. □ 

Theorem 2.1. 

(4) ds{X,Y) = dHiv{X),v{Y)) +2TankiX^ -Y^). 
Proof. By (2) it is sufficient to proof that 

(5) 2Tank{RE{X) * RE{Y)) ^ ki + k2 + dHiv{X),v{Y)) + 2rank(X^ - 
It is easy to verify that 

rank ( '^^{'^J I = rank | Y,,c 1 = rank | Y... 



RE{Y) 



(6) ^ rank ( ^^^^^g^ ^ * ^^^^ U rank ( ^^^^^^^^ * ^^^^ 



We note that the positions of the leading ones in all the rows of RE{X) * Y^c 
are in {1, 2, . . . , n} \ p{X, Y). By Corollary 1 the positions of the leading ones of 
all the rows of RE{Y^ — X^) arc in p{X,Y). Thus, by (6) we have 

(7) Tm\k{RE{X) * RE{Y)) ^T{mk{RE{RE{X) * Y^c) + rank(i;, - X^). 

Since the sets of positions of the leading ones of RE{X) and Y^c arc disjoint, 
we have that rank(i?£;(X) * Y^c) = fci + (fc2 - \p{X,Y)\), and thus by (7) 

(8) rank(i?£;(X) * RE{Y)) =ki + fca - HX, Y)\ + rank(y^ - X^). 
Combining (8) and (3) we have 

2TSink{RE{X) * RE{Y)) = fci + fcz + + 2 rank(f^ - X^), 

and by (5) this proves the theorem. 

□ 

Corollary 2. For any two subspaces X,Y C 

dsiX,Y)>dH{viX),viY)). 

Corollary 3. Let X and Y be two subspaces such that v{X) = v{Y). Then 

ds{X, r) = 2 T!mk{RE{X) - RE{Y)). 

In the sequel we will show how we can use these results to make the search of 
Icxicodcs more efficient. 
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3. Analysis of Constant Dimension Codes 

In this section we consider some properties of constant dimension codes which 
will help us to simplify the search for lexicodcs. First, we introduce the multilevel 
structure of a code in the Grassmannian. 

All the binary vectors of the length n and weight k can be considered as the 
identifying vectors of all the subspaces in Gq{n,k). These (^) vectors partition 
Gq{n, k) into the (^) different classes, where each class consists of all subspaces in 
Qq{n, k) with the same identifying vector. These classes are called Schubert cells [7, 
p. 147]. Note that each Schubert cell contains all the subspaces with the same given 
echelon Ferrers form. 

According to this partition all the constant dimension codes have a multilevel 
structure: we can partition all the codewords of a code into different classes (sub- 
codes), each of which have the same identifying vector. Therefore, the first level of 
this structure is the set of different identifying vectors, and the second level is the 
subspaces corresponding to these vectors. 

Let C C Qq(ri,k) be a constant dimension code, and let {wi, W2, . . . , w*} be all 
the different identifying vectors of the codewords in C. Let {Ci, C2, . . . , Ct} be the 
partition of C into t sub-codes induced by these t identifying vectors, i.e., v{X) = w^, 
for each X ^ Ci, 1 < i < t. 

Remark 1. We can choose any constant weight code C with minimum Hamming 
distance d to be the set of identifying vectors. If for each identifying vector u G C we 
have a sub-code Cy for which v{X) = v for each X € Cy, and dsiCy) = d, then by 
Corollary 2 we obtain a constant dimension code with the same minimum distance d. 
If for all such identifying vectors we construct the maximum size constant dimension 
sub-codes then we obtain the multilevel construction (ML construction, in short) 
which was described in [4] . One question that arises in this context is how to choose 
the best constant weight code for this ML construction. 

To understand the structure of a sub-code formed by some Ferrers diagram in- 
duced by an identifying vector, we need the following definitions. 

For two m X 77 matrices A and B over the rank distance, dii{A, B), is defined 

by 

dR{A,B) =^rank(yl-B) . 

A code C is an [m x 77, g, S] rank-metric code if its codewords are mx rj matrices over 
¥q, they form a linear subspace of dimension g of F™^**, and for each two distinct 
codewords A and B we have that dii{A, B) > 6. For an [m x 77, g, S] rank-metric code 
C we have g < min{m(?7 — 5-1-1), r]{m — S+l)} (see [3, 8, 14]). This bound is attained 
for all possible parameters and the codes which attain it are called maximum rank 
distance codes (or MRD codes in short). 

Let V be a vector of length n and weight k and let EF{v) be its echelon Ferrers 
form. Let T be the Ferrers diagram of EF[v). is an tti x 77 Ferrers diagram, 
m < k, r] < n — k. A code C is an [J-, g, 6] Ferrers diagram rank-metric code if all 
codewords of C are m x rj matrices in which all entries not in J- are zeroes, it forms 
a rank-metric code with dimension g, and minimum rank distance S. Let dim(J^, 5) 
be the largest possible dimension of an [J^, g, S] code. The following theorem [4] 
provides an upper bound on the size of such codes. 
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Theorem 3.1. For a given i, < i < S — 1, if vt is the number of dots in T , which 
are not contained in the first i rows and are not contained in the rightmost S — 1 — i 
columns, then imiii{i'i} is an upper bound on dim(J-", 5). 

It is not known whether the upper bound of Theorem 3.1 is attained for all 
parameters. A code which attains this bound, wiU be called an MRD (Ferrers 
diagram) code. This definition generalizes the previous definition of MRD codes, 
and a construction of such codes is given in [4]. 

Without loss of generality we will assume that k < n — k. This assumption can 
be justified as a consequence of the following lemma [5]. 

Lemma 3.2. If C is an {n, M, d, k)q constant dimension code then = {X^ : 
X G C}, where X^ is the orthogonal subspace of X , is an (n, M, d,n — k)q constant 
dimension code. 

For X g Gq{n, k), we define the k x [n — k) matrix R{X) as the sub-matrix of 
RE(X) with the columns which are indexed by zeroes of v{X). By Corollary 3, for 
any two codewords X,Y G Ci, Ci C C, 1 < i < t, the subspace distance between X 
and Y can be calculated in terms of rank distance, i.e., 

dsiX,Y)^2dR{R{X),R{Y)). 

For each sub-code C C, 1 < z < t, we define a Ferrers diagram rank-metric 
code 

1^' {RiX) : X G Q}. 

Note, that such a code is obtained by the inverse operation to the lifting operation, 
defined in [16]. Thus, R{<Ci) will be called the unlifted code of the sub-code Ci. 

We define the subspace distance between two sub-codes C;, Cj of C, 1 < i j < t 
as follows: 

ds(Q,C,) = uAn{ds{X,Y) : X e Q,r G C,}. 

By Corollary 2, 

c^s(Q,Cj) > dH{vi,Vj). 

The following lemma shows a case in which the last inequality becomes an equal- 
ity. 

Lemma 3.3. Let <Ci and Cj be two different sub-codes of C C Qq[n.,k), each one 
contains the subspace whose RREF is the corresponding column permutation of the 
matrix (/fcOfcx(ri-fc)); where Ik denotes the k x k identity matrix and Oaxb denotes 
an a X b allzero matrix. Then 

ds{Ci,Cj) = dH{vi,Vj). 

Proof. Let X € Ci and Y G Cj be subspaces whose RREF equal to column permu- 
tation of the matrix {IkOkx(n-k))- It is easy to verify that 

(9) rank ) = rank Y^c ^ = rank ^^^^ ) . 

Clearly, rank(y,,c) = ild^^^ and hence, rank(i?£;(X) * RE{Y)) = k+ M^iM. 
By (2), dsiX,Y) = 2i'Ank{RE{X)*RE{Y))-2k = 2k + dHivi,Vj)-2k = dniv.^Vj), 
i.e., dsiCijCj) < dnivijVj). By Corollary 2, ^^(CijCj) > dH{vi,Vj), and hence, 
ds{Ci,Cj) ^ dH{vi,Vj). □ 
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Corollary 4. Let Vi and Vj be two identifying vectors of codewords in an (n, M, d, k)q 
code C. If dnivi^Vj) < d then at least one of the corresponding sub-codes, Ci and 
Cj , does not contain the subspace with RREF which is a column permutation of the 
matrix {IkOkx{n-k))- I^i other words, the corresponding unlifted code is not linear 
since it does not contain the allzero codeword. 

Assume that we can add codewords to a code C, c?s(C) = d, constructed by the 
ML construction with a maximal constant weight code (for the identifying vectors) 
C, dniO) = d. Corollary 4 implies that any corresponding unlifted Ferrers diagram 
rank-metric code of any new identifying vector will be nonlinear. 

The next two lemmas reduce the search domain for constant dimension lexicodcs. 

Lemma 3.4. Let C be an (n, M, d = 26, k)q constant dimension code. Let Ci C C, 
v(X) = wi = 11 . . . 100 ... for each X Ci, be a sub-code for which i?(Ci) attains 



the upper bound of Theorem 3.1, i.e., |Ci| ~ |i?(Ci) 
is no codeword Y in C such that dH{v{Y),vi) < d. 



(fc_5 + l)(„_fc)_ rj,^^^ ^^g^g 



Proof. Let C be a given {n,M,d = 2d,k)q constant dimension code. Since the 
minimum distance of the code is d, the intersection of any two subspaces in C is at 
most of dimension k — ^ = k — S. Therefore, a subspace of dimension k — S -\- 1 can 
be contained in at most one codeword of C. 
Wc define the following set of subspaces: 

{X e gq{n, fc - J + 1) : supp{v{X)) C supp{vi)}, 

where supp{v) is as the set of nonzero entries in v. Each codeword of the sub-code 
k 



-5 + 1 



subspaces of dimension k — 6 + 1, and all subspaces of 



Ci contains 

dimension k — 6 + 1 which are contained in codewords of Ci are in A. Since |Ci 

k 



qik-s+i)in-k) ^ it follows that Ci contains qik-s+i)in-k) 
of A. 

Now we calculate the size of A. First we observe that 



k~S + l 



subspaces 



A = {X e Gqin, fc-(5-M) : v{X) = ab, \a\ = k, \b\ = n-k, w{a) ^ k-S+l, w{b) = 0}, 

where \v\ and w{v) are the length and the weight of a vector v, respectively. Thus 
EF{v{X)) of each v{X) = ab, such that X e A, has the form 



(10) 



EF{v{X)) 



EF{a) 



The number of dots in (10) is {k — 5 + \){n — k) , and the size of the following set 
{EF{a) : \a\ = k, w{a) = k-6 + l} 



IS 



k 

k-5- 



Therefore, \A\ 



k 

-S + l 



. q{k s+i){n fc) Hence, each sub- 



space of A is contained in some codeword from Ci. A subspace Y G Gqin, k) with 
dH{v{Y), vi) = 26 — 2i, 1 < i < 6 — 1, contains some subspaces of A, and therefore, 
Y ^C. □ 

Lemma 3.5. Let C be an {n,M,d ^ 26,k)q constant dimension code, where 6 — 
1 < k — 6. Let C2 be a sub- code of C which corresponds to the identifying vector 
V2 = abfg, where a = 11 ... 1, & = 00 ... 0, / = 11 ... 1, and g = 00 ... 0. Assume 
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further that i?(C2) attains the upper bound of Theorem 3.1, i.e., IC2I = |i?(C2)I ~ 
q{k-s+i){n-k)-5\ rpf^^^ ^^g^g codeword Y €C with v{Y) = a'b'fg', \a'b'\ ^ k, 
\g'\ ~ n — k — 5 , such that d// (w(Y), < d. 

Proof. Similarly to the proof of Lemma 3.4, we define the following set of subspaces: 
B = {X e Gqin, k-6 + 1): v{X) = a"bfg with \a"\ ^ k - S, w{a") =k-26+l}. 



As in the previous proof, we can sec that C2 contains g'^''' ■ 



k~8 
fc - 25 + 1 



subspaces of B. In addition, \B\ 



k~6 
k-25 + 1 



■ q 



k-5 
k-25 + 1 



■ q 



(fc-a+l)(n-fc)-5^ 



Thus each subspace in B is contained in some 



codeword from C2. A subspace Y G Qq{n,k), such that v{Y) = a'b'fg' {\a'b'\ = k, 
\g'\ = n — k — 6), with W2) = 26 — 2i, 1 < i < 6 — 1, contains some subspaces 



of B, and therefore, Y 



□ 



4. Search for Constant Dimension Lexicodes 

In this section we describe our search method for constant dimension lexicodes, 
and present some resulting codes which are the largest currently known constant 
dimension codes for their parameters. 

To search for large constant dimension code we use the multilevel structure of 
such codes, described in the previous section. First, we order the set of all binary 
words of length n and weight k by an appropriate order. The words in this order 
are the candidates to be the identifying vectors of the final code. In each step of the 
construction we have the current code C and the set of subspaces not examined yet. 
For each candidate for an identifying vector v taken by the given order, we search for 
a sub-code in the following way: for each subspace X (according to the lexicographic 
order of subspaces associated with v) with the given Ferrers diagram we calculate 
the distance between X and C, and add X to C if this distance is at least d. By 
Theorem 2.1 and Corollary 2 it follows that in this process, for some subspaces it is 
enough only to calculate the Hamming distance between the identifying vectors in 
order to determine a lower bound on the subspace distance. In other words, when 
we examine a new subspace to be inserted into the lexicode, we first calculate the 
Hamming distance between its identifying vector and the identifying vector of a 
codeword, and only if the distance is smaller than d, we calculate the rank of the 
corresponding matrix, (see (4)). Moreover, by the multilevel structure of a code, 
we need only to examine the Hamming distance between the identifying vectors 
of representatives of sub-codes, say the first codewords in each sub-code. This 
approach will speed up the process of the code generation. 

A construction of constant dimension lexicodes based on the Ferrers tableaux 
form ordering of the Grassmannian was give in [15]. Note that in this construction 
we order the identifying vectors by the sizes of corresponding Ferrers diagrams. 
The motivation is that usually a larger diagram contributes more codewords than 
a smaller one. 

Example 3. Table 1 shows the identifying vectors and the sizes of corresponding 
sub-codes in the (8, 4605, 4, 4)2 lexicode, C''^^ (see [15]), and the (8, 4573, 4, 4)2 code, 
C*^^, obtained by the ML construction [4]. 
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Table 1. C^""^ vs. C^^^ in 02(8,4) with 4 



i 


id. vector Vi 


size of C^"^ 


Size of Cf'^ 


1 


11110000 


4096 


4096 


2 


11001100 


256 


256 


3 


10101010 


64 


64 


4 


10011010 


16 


- 


5 


10100110 


16 




6 


00111100 


16 


16 


7 


01011010 


16 


16 


8 


01100110 


16 


16 


9 


10010110 


16 


16 


10 


01101001 


32 


32 


11 


10011001 


16 


16 


12 


10100101 


16 


16 


13 


11000011 


16 


16 


14 


01010101 


8 


8 


15 


00110011 


4 


4 


16 


00001111 


1 


1 



We can see that these two codes have the same identifying vectors, except for 
two vectors 10011010 and 10100110 in the lexicode C'*^^ which form the difference 
in the size of these two codes. In addition, there are several sub-codes of C'^'^ for 
which the corresponding unhfted codes are nonlinear: C'f^, C'^^, Clf^ , 0^^, 'C'fi, 
and ■ However, all these unlifted codes are cosets of hnear codes. 



In general, not all unlifted codes of lexicodes based on the Ferrers tableaux form 
representation are linear or cosets of some linear codes. However, if we construct 
a binary constant dimension lexicode with only one identifying vector, the unlifted 
code is always linear. This phenomena can be explained as an immediate conse- 
quence from the main theorem in [19]. However, it does not explain why some 
of unlifted codes in Example 3 are cosets of linear codes, and why Cg*^^ is linear 
(dff(w5,W9) < 4)? 

Based on Theorem 2.1, Lemmas 3.4, and 3.5, we suggest an improved search of 
a constant dimension {n, M,d,k)q code, which will be called a lexicode with a seed. 

In the first step we construct a maximal sub-code Ci which corresponds to the 
identifying vector 1 1 . 100 . . This sub-code corresponds to the largest Ferrers 

k n — k 

diagram. In this step we can take any known [k x {n — k), [n — k){k — f + 1), |] 
MRD code (e.g. [8]) and consider its codewords as the unlifted codewords (Ferrers 
tableaux forms) of Ci. 

In the second step we construct a sub-code C2 which corresponds to the identi- 
fying vector 11 . 100 - ■ Qll — ■ ipO — ■ Q - According to Lemma 3.4, we cannot use 

k—5 5 5 n—k—S 

identifying vectors with larger Ferrers diagrams (except for the identifying vector 
1 1 . 100 . already used). If there exists an MRD (Ferrers diagram) code with 

k n — k 

the corresponding parameters, we can take any known construction of such code 
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(see in [4]) and build from it the corresponding sub-code. If a code which attains 
the bound of Theorem 3.1 is not known, we take the largest known Ferrers diagram 
rank-metric code with the required parameters. 

In the third step we construct the other sub-codes, according to the lexicographic 
order based on the Ferrers tableaux form representation. We first calculate the Ham- 
ming distance between the identifying vectors and examine the subspace distance 
only of subspaces which are not pruned out by Lemmas 3.4 and 3.5. 

Example 4. Let n=10,fc = 5,d = 6, and q = 2. By the construction of a 
lexicode with a seed we obtain a constant dimension code of size 32890. (A code of 
size 32841 was obtained by the ML construction [4]). 

Example 5. Let n = 7, fc = 3, d = 4, and q = i. By the construction a lexicode 
with a seed we obtain a constant dimension code of size 6691. (A code of size 6685 
was obtained by the ML construction [4]). 

We introduce now a variant of the construction of a lexicode with a seed. As a 
seed we take a constant dimension code obtained by the ML construction [4] and 
try to add some more codewords using the lexicode construction. Similarly, we can 
take as a seed any subset of codewords obtained by any given construction and to 
continue by applying the lexicode with a seed construction. 

Example 6. Let n = 8, /c = d = 4, and g = 2. We take the (8,4573,4,4)2 code 
obtained by the ML construction (see Table 1) and then continue with the lexicode 
construction. The size of the resulting code is 4589 (compared to C'*^^ of size 4605 
in Table 1), where there are two additional sub-codes of size 8 which correspond to 
identifying vectors 10011010 and 10100110. 

Example 7. Let n = 9, /c = = 4, and q = 2. Let C be a (9, 2^^ + 2" + 2^ 4, 4)2 
code obtained as follows. We take three codes of sizes 2^^, 2^^, and 2^, corresponding 
to identifying vectors 111100000, 110011000, and 110000110, respectively, and then 
continue by applying the lexicode with a seed construction. For the identifying 
vector 111100000 we can take as the unlifted code, any code which attains the 
bound of Theorem 3.1. To generate the codes for the last two identifying vectors 
with the corresponding unlifted codes (which attains the bound of Theorem 3.1), 
we permute the order of entries in the Ferrers diagrams and apply the lexicode 
construction. The Ferrers diagrams which correspond to the identifying vector 
110011000 and 110000110 arc 



respectively. The coordinates' order of their entries (defined in the Introduction) is: 

15 13 9 5 1 11 9 7 5 1 

16 14 10 6 2 12 10 8 6 2 

11 7 3 ' 3 ' 

12 8 4 4 
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respectively. The order of the coordinates that we use to form an MRD code 
(lexicode) is 

11 7531 97531 
15 12 8 2 4 11 10 8 2 4 
13 9 6 ' 6 ■ 

16 14 10 12 
As a resuh, we obtain a code of size 37649 which is the largest known constant 
dimension code with these parameters. 

Remark 2. One of the most interesting questions, at least from a mathematical 
point of view, is the existence of a (7, 381, 4, 3)2 code C [6]. If such code exists one 
can verify that it contains 128 codewords with the identifying vector 1110000 which 
is half the size of the corresponding MRD code. It suggests that the unlifted Ferrers 
diagram rank-metric code of the largest Ferrers diagram is not necessarily an MRD 
code, in the largest constant dimension code with given parameters n, k, and d. 

5. Conclusion and Open Problems 

We have described a search method for constant dimension codes based on their 
multilevel structure. Some of the codes obtained by this search are the largest 
known constant dimension codes with their parameters. We described several ideas 
to make this search more efficient. In this context a new formula for computation 
of the subspace distance between two subspaces of F^'' is given. It is reasonable 
to believe that the same ideas will enable to improve the sizes of the codes with 
parameters not considered in our examples. We hope that a general mathematical 
technique to generate related codes with larger size can be developed based on our 
discussion. Our discussion raises several more questions for future research: 

(1) Is the upper bound of Theorem 3.1 on the size of Ferrers diagram rank- 
metric code is attainable for all parameters? 

(2) What is the best choice of identifying vectors for constant dimension lexi- 
code in general, and for the the ML construction in particular? 

(3) Can every MRD Ferrers diagram code be generated as a lexicode by using 
a proper permutation on the coordinates (see Example 7)? 

(4) Is there an optimal combination of linear Ferrers diagram rank-metric codes 
and cosets of linear Ferrers diagram rank-metric codes to form a large con- 
stant dimension code? 

(5) For which n and k there exists an order of all identifying vectors such that 
all the unlifted codes (of the lexicode) will be either linear or cosets of linear 
codes (see Example 3). 
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